Séparation de sources audio en milieu réverbérant : Factorisation en matrices non-négatives et représentation temporelle du mélange convolutif

نویسندگان

  • Simon Leglaive
  • Roland Badeau
  • Gaël Richard
چکیده

This paper addresses the problem of multichannel audio source separation in under-determined reverberant mixtures. We target a semi-blind scenario assuming that the mixing lters are known. The proposed method consists in working directly with the time-domain mixture signals. This approach makes it possible to accurately represent the convolutive mixing process, it is therefore suitable for the separation of highly reverberant mixtures. The source signals are represented in the modi ed discrete cosine transform domain with a Gaussian model based on non-negative matrix factorization (NMF). Source inference is based on a variational expectation-maximization algorithm. We experimentally show the advantage of using a time-domain representation of the convolutive mixture and a source model based on NMF.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-Negative Matrix Factorization Applied to Auditory Scenes Classification

This master's thesis is dedicated to the automatic classification of auditory scene using non-negative matrix factorization. A particular attention is paid to the performances achieved by the non-negative matrix factorization in sound sources detection. Our intuition was that a good classification could be achieve if we could efficiently detect the sources within auditory scenes. It appears on ...

متن کامل

Séparation aveugle d’un mélange convolutif de sources non linéaires par une approche hiérarchique

This paper deals with blind source separation of convolutive mixtures of non linear (and hence non i.i.d.) sources. This general framework, although rarely treated in literature, is of great importance in applications. An iterative method was proposed [10]; however, the deflation procedure induces an error accumulation effect. We propose a hierarchical approach inspired from [6] which is less s...

متن کامل

La prise en compte de la dimension temporelle dans la classification de données

Résumé. Dans un contexte d’ingénierie de la connaissance, l’analyse des données relationnelles évolutives est une question centrale. La représentation de ce type de données sous forme de graphe optimisé en facilite l'analyse et l'interprétation par l’utilisateur non expert. Cependant, ces graphes peuvent rapidement devenir trop complexes pour être étudiés dans leur globalité, il faut alors les ...

متن کامل

Architecture et Outils pour la Recherche d'Evénements dans les Séquences Vidéo

RÉSUMÉ. Le problème abordé ici concerne l’indexation en ligne de données multimédia par la recherche d’extraits pertinents qui peuvent aussi être des réponses à des requêtes spécifiques. Nos travaux se focalisent sur l’analyse de séquences vidéo afin d’y détecter des événements prédéfinis. La recherche de ces événements étant contextuelle, nous proposons une architecture et des outils générique...

متن کامل

Well-posedness of the permutation problem in sparse filter estimation with lp minimization

Convolutive source separation is often done in two stages: 1) estimation of the mixing filters and 2) estimation of the sources. Traditional approaches suffer from the ambiguities of arbitrary permutations and scaling in each frequency bin of the estimated filters and/or the sources, and they are usually corrected by taking into account some special properties of the filters/sources. This paper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017